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Virtual Reality a.k.a Multi-persective rendering 
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Hybrid rendering in games 

Shadows 
Reflections 
Transparency 

Better scaling with multiple dynamic lights 
Real-time light map updates for the rasteriser 
Easy to bolt on 
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Fully ray traced graphics 

Brute force path tracing 
Produces photo realism easily 

Pretty much requires all the 3D content to be ray traced 

Possible in todays technology in console / desktop using Wizard 

Probably not practical for fully real-time use in mobile for a couple 
more generations. Fine for sub-realtime use. 
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Non-graphics 




Better in-game Al 
Collision Detection 
3D spatial search 
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Power and Bandwidth 
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Capability of our mobile graphics 


• Almost 5x compute capability 

• 4x vec4 MAD -► 16x Scalar 2xMAD 






• 1 .5 -► 3.6 Gtexels/s 

• OpenGLES 3.0 Support 
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• An instance is a vertex, pixel or OpenCL thread 

• All instances in the task share Program, Uniforms, Parameters, etc. 
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varying vec2 texCoord; 

void main() { 

if (texCoord.x >0.5) { 


> else { 
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Private registers per instance) 
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Overall capability 


• Series6 gives you over lOOGflops for shading on latest mobile devices 

• OpenGL ES 3.0 support 

• Massive capability gain allowing math heavy shaders 

• Mobile graphics is where the actual innovation is happening 

• Try it! 
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New shader type 

Ray Shaders are invoked when a ray intersects a triangle 
Shaders types can emit any number of rays 
OpenRL has a frame shader to emit primary rays 
Existing fragment / pixel shaders can emit rays also! 

GLSL Programming model is the same 
Built-in functions, parameters, etc. 
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Primitive Objects 


Encapsulate the rendering state for one mesh 
Includes VBOs, uniforms, shader programs, texture bindings, etc. 

Ray tracing unit sorts rays into tasks with common primitive objects 

Persistent between frames 
Mutable objects by the client 
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Constraints / Limitations 



• Shaders cannot wait for results of individual ray trace operations 

• Shaders must provide a worst case estimate on number of child rays 

• Per-ray user data payload must be carefully managed 
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Parallelism is on 


Task 1 


rays NOT pixels 



Common 
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(In this example, ALU width is 4 
to make the diagram smaller) 
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Parallelism is on rays NOT pixels 
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Ray:AABB Test 


“Fast Ray-Axis Aligned Bounding Box Overlap Tests with Plucker 
Coordinates.” - Jeffrey Mahovsky and Brian Wyvill 

6 lines form the silhouette of the AABB 

6 planes from the ray origin and each edge vector 

Dot product of plane normal and ray direction vector 

6 signs must match and be negative. 

CQ Imagination 



idc14 
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Automatically finds coherence paths 
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Limitations 


• Scene is represented by triangles - same as today 

• BVH is in a defined format optimised for construction and traversal 

• Triangle order must generally follow a spatially coherent flow 

• An approximate scene scale estimation is needed 

• Geometry shaders are not inline with the ray tracing pipeline 
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Strengths 

Shading cluster workload is no higher than a vertex shader 
Only needs to process geometry that actually moved in world space 
Unique algorithm constrains working set to internal registers only 
Single pass operation: in-line with vertex shader execution 
Handles the “long skinny triangles” problem well 
Streaming writes to external memory 

Losslessly compressed output formats due to build algorithm 
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Get early access to the programming concepts at: 

http://tinyurl.com/9ddlsv2 




Talk immediately after this on Hybrid rendering 


See the demos at Imagination Booth #402 South Hall 
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